REMARKS 



Claims 1-18 and 20-48 were pending in this application. Claims 1-14 and 33-47 
have been withdrawn from consideration as being directed to non-elected inventions. 
Claim 27 has been amended to clarify its content. 

Support for amended claim 27 can be found on page 35, lines 7-24 and on page 
62, lines 7-1 1 of the instant application. Thus, as a result of the foregoing amendment, 
claims 15-18, 20-32 and 48 remain for consideration. 

No new matter has been added by way of this amendment. Reconsideration of 
this application is respectfully requested. 

Claim Rejections under 35 U.S.C. §112, second paragraph 
Claims 15-32 are rejected under 35 U.S.C. §112, second paragraph, as being 
indefinite. The Examiner alleges that claim 27 recites the limitation "wherein the two 
homologous nucleotide sequences are IRESEGFP" and that the exact nucleotide 
sequence encompassed by this term is not described in the instant application and does 
not appear to be a consensus sequence known in the art. 

Applicants respectfully traverse this rejection and have amended claim 27 to 
clarify the usage of the term "IRESEGFP". Furthermore, Applicants provide for the 
Examiner's convenience a copy of two US Patents, Nos. 5,491,084 and 5,777,079 drawn 
to green fluorescent proteins and their uses. Applicants respectfully refer the Examiner to 
the filing date of both issued patents to clearly point out to the Examiner that such marker 
proteins, and the nucleic acid sequences encoding such proteins were known to those 
skilled in the art at the time the present application was filed. Applicants believe that the 
claim amendment as well as the support for the level of skill in the art at the time the 
instant application was filed would thus obviate the rejection of this claim as well as any 
dependent claims. In light of the foregoing, Applicants respectfully request withdrawal of 
the rejection. 
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Double Patenting 

Claims 15-18, 20, 25-26, 29, and 48 have been rejected under the judicially 
created doctrine of obviousness-type double patenting as being unpatentable over claims 1, 
and 6-7 of U.S. Patent No. 6,485,912. Applicants respectfully traverse the Examiner's 
rejection and herewith file a Terminal Disclaimer in compliance with 37 CFR 1.321(c), 
disclaiming the term of any patent granted on the instant application beyond that of the 
aforementioned patent. In light of the foregoing, withdrawal of the rejection is respectfully 
requested. 

Fees 

No fees are believed to be required, but if so, the Commissioner is hereby 
authorized to charge any fees, or credit any overpayment, to Deposit Account No. 11- 
1153. 

Conclusion 

Applicants believe that the foregoing amendments to the claims and the 
submission of a Terminal Disclaimer place the application in condition for allowance. 
Withdrawal of the rejections is respectfully requested. If a discussion with the 
undersigned will be of assistance in resolving any remaining issues, the Examiner is 
invited to telephone the undersigned at (201) 487-5800, ext. 1 18, to effect a resolution. 



KLAUBER & JACKSON 
411 Hackensack Avenue 
Hackensack, NJ 07601 
(201) 487-5800 
Date: July 9, 2003 



Respectfully submitted, 



By 



Veronica Mallon, Ph.D. 
Agent for Applicant(s) 
Reg. No. 52,491 



Enclosures: Terminal Disclaimer 

US Patent Nos: 5,491,084 and 5,777,079 
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[57] ABSTRACT 

This invention provides a cell comprising a DNA molecule 
having a regulatory element from a gene, other than a gene 
encoding a green-fluorescent protein operatively linked to a 
DNA sequence encoding the green-fluorescent protein. This 
invention also provides a method for selecting cells express- 
ing a protein of interest which comprises: a. introducing into 
the cells a DNAI molecule having DNA sequence encoding 
the protein of interest and DNAII molecule having DNA 
sequence encoding a green-fluorescent protein; b. culturing 
the introduced cells in conditions permitting expression of 
the green-fluorescent protein and the protein of interest; and 
c. selecting the cultured cells which express green-fluores- 
cent protein, thereby selecting cells expressing the protein of 
interest. Finally, this invention provides various uses of a 
green-fluorescent protein. 

12 Claims, 3 Drawing Sheets 
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USES OF GREEN-FLUORESCENT PROTEIN 

The invention disclosed herein was made with Govern- 
ment support under NIH Grant No. 5R01GM30997 from the 
Department of Health and Human Services. Accordingly, the 5 
U.S. Government has certain rights in this invention. 

BACKGROUND OF THE INVENTION 

Throughout this application various references are 
referred to within parenthesis. Disclosures of these publica- 10 
tions in their entireties are hereby incorporated by reference 
into this application to more fully describe the state of the an 
to which this invention pertains. Full bibliographic citation 
for these references may be found at the end of this appli- 
cation, preceding the sequence listing and the claims. 15 

Several methods are available to monitor gene activity 
and protein distribution within cells. These include the 
formation of fusion proteins with coding sequences for 
p-galactosidase (21), and luciferases (21). The usefulness of 2Q 
these methods is often limited by the requirement to fix cell 
preparations or to add exogenous substrates or cofactors. 
This invention disclose a method of examining gene expres- 
sion and protein localization in living cells that requires no 
exogenously-added compounds. 25 

This method uses a cDNA encoding the Green-Fluores- 
cent Protein (GFP) from the jelly fish Aequorea victoria (5). 
In A. victoria, GFP absorbs light generated by aequorin upon 
the addition of calcium and emits a green light. 

This invention discloses that GFP expressed in prokary- 30 
otic and eukaryotic cells is capable of producing a strong 
green fluorescence when excited near UV or blue light. 
Since this fluorescence requires no additional gene products 
from A. victoria, chromophore formation is not species 
specific. 35 

SUMMARY OF THE INVENTION 

This invention provides a cell comprising a DNA mol- 
ecule having a regulatory element from a gene, other than a 
gene encoding a green- fluorescent protein operatively linked 40 
to a DNA sequence encoding the green-fluorescent protein. 

This invention provides a method for selecting cells 
expressing a protein of interest which comprises: a) intro- 
ducing into the cells a DNAI molecule having DNA 45 
sequence encoding the protein of interest and DNAII mol- 
ecule having DNA sequence encoding a green-fluorescent 
protein; b) culturing the introduced cells in conditions 
permitting expression of the green-fluorescent protein and 
the protein of interest; and c) selecting the cultured cells 5Q 
which express green-fluorescent protein, thereby selecting 
cells expressing the protein of interest. 

This invention also provides a method for localizing a 
protein of interest in a cell: a. introducing into a cell a DNA 
molecule ' having DNA sequence encoding the protein of 55 
interest linked to DNA sequence encoding a green-fluores- 
cent protein such that the protein produced by the DNA 
molecule will have the protein of interest fused to the 
green-fluorescent protein; b. culturing the cell in conditions 
permitting expression of the fused protein; c. detecting the ^ 
location of the green-fluorescent protein in the cell, thereby 
localizing a protein of interest in a cell. 

BRIEF DESCRIPTION OF FIGURES 

FIG. 1 Expression of GFP in E. coll The bacteria on the 65 
right side of the figure have the GFP expression plasmid. 
This photograph was taken while irradiating the plate with 
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a hand-held long-wave UV source. 

FIG. 2 Excitation and Emission Spectra of E. co/f-gener- 
ated GFP (solid lines) and purified A. victoria GFP (L form; 
dotted lines). 

FIG. 3 Expression of GFP in a first stage Caenorhabditis 
elegans larva. Two touch receptor neurons (PLML and 
ALML) and one other neuron of unknown function (ALNL) 
are indicated. Processes can be seen projecting from all three 
cell bodies. The arrow points to the nerve ring branch from 
the ALML cell (out of focus). The background fluorescence 
is due to the animal's autofluorescence. 

DETAILED DESCRIPTION OF THE 
INVENTION 

Throughout this application, the following standard 
abbreviations are used to indicate specific nucleotides: 



C = cytosine A = adenosine 

T = thymidine G = guanosine 



This invention provides a cell comprising a DNA molecule 
having a regulatory element from a gene, other than a gene 
encoding a green-fluorescent protein operatively linked to a 
DNA sequence encoding the green-fluorescent protein. 

This invention provides a cell comprising a DNA mol- 
ecule having a regulatory element from a gene, other than a 
gene encoding a green-fluorescent protein operatively linked 
to a DNA sequence encoding the green-fluorescent protein, 
wherein the cell is selected from a group consisting essen- 
tially of bacterial cell, yeast cell, fungal cell, insect cell, 
nematode cell, plant or animal cell. 

Suitable animal cells include, but are not limited to Vero 
cells, HeLa cells, Cos cells, CV1 cells and various primary 
mammalian cells. 

In an embodiment, the bacterial cell is Escherichia coli 

As used herein, "a regulatory element" from a gene is the 
DNA sequence which is necessary for the expression of the 
gene. 

In this invention, the term "operatively linked" to means 
that following such a link the regulatory element can direct 
the expression of the linked DNA sequence which encodes 
a green-fluorescent protein. 

The gene encoding a green-fluorescent protein includes 
DNA molecules coding for polypeptide analogs, fragments 
or derivatives of antigenic polypeptides which differ from 
naturally-occurring forms in terms of the identity or location 
of one or more amino acid residues (deletion analogs 
containing less than all of the residues specified for the 
protein, substitution analogs wherein one or more residues 
specified are replaced by other residues and addition analogs 
where in one or more amino acid residues is added to a 
terminal or medial portion of the polypeptides) and which 
share some or all properties of naturally-occurring forms. 

These DNA molecules include: the incorporation of 
codons "preferred" for expression by selected non-mamma- 
lian hosts; the provision of sites for cleavage by restriction 
endonuclease enzymes; and the provision of additional 
initial, terminal or intermediate DNA sequences that facili- 
tate construction of readily expressed vectors. 

As an example, plasmid pGFPIO.l codes for a mutated 
GFP protein with the 80th codon changed from a glutamine 
to arginine. This mutated protein retains the property of the 
natural protein. 

In an embodiment, the regulatory element is a promoter. 
In a further embodiment, the promoter is activated by a 
heavy metal. Such promoters are well-known in the art (J. H. 
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Freedman, L. W. Slice, A. Fire, and C. S. Rubin (1993) 
Journal of Biological Chemistry, 268:2554). 

In another embodiment, the promoter is a P450 promoter. 
Cytochrome P450 is well-known in the art and there are a 
number of P450 promoters known. 5 

In still another embodiment, the promoter is for a stress 
protein. Such stress proteins are well-known in the art (E. G. 
Stringbam, D. K. Dixon, D. Jones and E. D. Candido (1992) 
Molecular Biology of the Cell 3:221; and William J. Welch 
(May, 1993), Scientific American, page 56). In a further 10 
embodiment, the stress protein is a heat-shock protein. 

Triis invention provides a cell comprising a DNA mol- 
ecule having a regulatory element from a gene, other than a 
gene encoding a green-fluorescent protein operati vely linked 
to a DNA sequence encoding the green-fluorescent protein, is 
wherein the promoter is for a gene viable to the cell growth. 

In another embodiment, the regulatory element is an 
enhancer. Enhancers are well-known in the art. 

This invention provides a cell comprising a DNA mol- 
ecule having a regulatory element from a gene, other than a 20 
gene encoding a green-fluorescent protein operatively linked 
to a DNA sequence encoding the green-fluorescent protein, 
wherein the DNA sequence encoding the Aequorea victoria 
green-fluorescent protein. 

In an embodiment, the Aequorea victoria green- 25 
fluorescent protein is cloned in a plasmid. This plasmid is a 
modification of the pBS(+) (formerly called pBluescribe +) 
vector (Stratagene®) which has inserted within it an Eco RI 
fragment containing the cDNA sequence of the Aequorea 
victoria green-fluorescent protein (as modified herein). The 30 
fragment was obtained from XGFP10 (Prasher, D. C, Eck- 
enrode, V. K., Ward, W. W., Prendergast, F. G., and Cormier, 
M. J., (1992) Primary structure of the Aequorea victoria 
green-fluorescent protein. Gene, 111:229-233) by amplifi- 
cation using the polymerase chain reaction (Saiki, R. K., 35 
Gelfand, D. H., Stoffel, S., Sharf, S. J., Higuchi, G. T, Horn, 
G. X, Mullis, K. B., and Erlich, H. A. (1988) Primer-directed 
enzymatic amplification of DNA with a thermostable DNA 
polymerase. Science, 239:487-491) with primers flanking 
the Eco RI sites and subsequent digestion with Eco RI. The 40 
sequence of the cDNA in pGFPlO.l differs from the pub- 
lished sequence (5) by a change of the 80th codon of the 
coding sequence from CAG to CGG, a change that produces 
a glutamine to arginine change in the protein sequence. 

This pGFPlO.l plasmid was deposited on Sep. 1, 1993 45 
with the American Type Culture Collection (ATCC), 12301 
Parklawn Drive, Rockville, Md. 20852, U.S.A. under the 
provisions of the Budapest Treaty for the International 
Recognition of the Deposit of Microorganism for the Pur- . 
poses of Patent Procedure. Plasmid pGFPlO.l was accorded 50 
ATCC Accession Number 75547. 

This invention provides a living organism comprising the 
cell comprising a DNA molecule having a regulatory ele- 
ment from a gene, other than a gene encoding a green- 
fluorescent protein operatively linked to a DNA sequence 55 
encoding the green-fluorescent protein. 

In an embodiment, the living organism is C. elegans. In 
another embodiment, the living organism is Drosophila, 
zebra fish or bacteriophage. 

A bacteriophage carrying the green-fluorescent protein 60 
. gene can infect a particular type of bacteria. The infection 
may be easily detected via the expression of the green- 
fluorescent protein. Therefore, by using appropriate bacte- 
riophages, the presence of that particular type of bacteria 
may be detected. 65 

Similarly, a virus carrying the green-fluorescent protein 
gene may infect a specific cell type. The infection may be 



easily detected by monitoring the expression of the green- 
fluorescent protein. 

The above-described cells and living organisms are useful 
to detect effects of external stimulus to the regulatory 
element. The stimulus may have direct or indirect effects on 
the regulatory element. Such effects will be detectable 
through either the induction of expression and production of 
the green-fluorescent protein or switching off the expression 
of the green-fluorescent protein. 

Cells expressing the green-fluorescent proteins may be 
conveniently separated by a fluorescence-activated cell 
sorter. 

These cells and organisms may be used to detect the 
presence of different molecules in various kinds of biologi- 
cal samples such as blood, urine or saliva. By operatively 
linking a regulatory element of the gene which is affected by 
the molecule of interest to a green-fluorescent protein, the 
presence of the molecules will affect the regulatory element 
which in turn will affect the expression of the green- 
fluorescent protein. Therefore, the above-described cells are 
useful for the detection of molecules. Such detection may be 
used for diagnostic purposes. An example of such a mol- 
ecule is a hormone. 

This invention provides a living organism comprising the 
cell comprising a DNA molecule having a regulatory ele- 
ment from a gene, other than a gene encoding a green- 
fluorescent protein operatively linked to a DNA sequence 
encoding the green-fluorescent protein, wherein the regula- 
tory element is for a stress protein. 

This invention provides a living organism comprising the 
cell comprising a DNA molecule having a regulatory ele- 
ment from a gene, other than a gene encoding a green- 
fluorescent protein operatively linked to a DNA sequence 
encoding the green-fluorescent protein, wherein the stress 
protein is a heat-shock protein. 

This invention provides a method for selecting cells 
expressing a protein of interest which comprises: a) intro- 
ducing into the cells a DNAI molecule having DNA 
sequence encoding the protein of interest and DNAII mol- 
ecule having DNA sequence encoding a green-fluorescent 
protein; b) culturing the introduced cells in conditions 
permitting expression of the green-fluorescent protein and 
the protein of interest; and c) selecting the cultured cells 
which express green-fluorescent protein, thereby selecting 
cells expressing the protein of interest 

This invention also provides the above method, wherein 
the cells are selected from a group consisting essentially of 
bacterial cells, yeast cells, fungal cells, insect cells, nema- 
tode cells, plant or animal cells. Suitable animal cells 
include, but are not limited to Vero cells, HeLa cells, Cos 
cells, CV1 cells and various primary mammalian cells. 

In an embodiment, DNAI and DNAII are linked. In 
another embodiment, the DNA encodes the Aequorea vic- 
toria green-fluorescent protein. 

This invention provides a method for localizing a protein 
of interest in a cell which comprises: a) introducing into a 
cell a DNA molecule having DNA sequence encoding the 
protein of interest linked to DNA sequence encoding a 
green-fluorescent protein such that the protein produced by 
the DNA molecule will have the protein of interest fused to 
the green-fluorescent protein; b) culturing the cell in condi- 
tions permitting expression of the fused protein; and c) 
detecting the location of the green-fluorescent protein in the 
cell, thereby localizing a protein of interest in a cell. 

Regulatory elements required for expression include pro- 
moter sequences to bind RNA polymerase and transcription 
initiation sequences for ribosome binding. For example, a 
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bacterial expression vector includes a promoter such as the 
lac promoter and for transcription initiation the Shine- 
Dalgarno sequence and the start codon AUG. Similarly, a 
eukaryotic expression vector includes a heterologous or 
homologous promoter for RNA polymerase II, a down- 5 
stream polyadenylation signal, the start codon AUG, and a 
termination codon for detachment of the ribosome. Such 
vectors may be obtained commercially or assembled from 
the sequences described by methods well-known in the art, 
for example the methods described above for constructing lQ 
vectors in general. 

The host cell of the above expression system may be 
selected from the group consisting of the cells where the 
protein of interest is normally expressed, or foreign cells 
such as bacterial cells (such as E. coif), yeast cells, fungal 
cells, insect cells, nematode cells, plant or animal cells, 15 
where the protein of interest is not normally expressed. 
Suitable animal cells include, but are not limited to Veto 
cells, HeLa cells, Cos cells, CV1 cells and various primary 
mammalian cells. 

In an embodiment of the method for localizing a protein 20 
of interest in a cell, the DNA encoding the green-fluorescent 
protein is from Aequorea victoria. 

This invention provides a method for localizing a protein 
of interest in a cell which comprises: a) introducing into a 
cell a DNA molecule having DNA sequence encoding the 25 
protein of interest linked to DNA sequence encoding a 
green-fluorescent protein such that the protein produced by 
the DNA molecule will have the protein of interest fused to 
the green-fluorescent protein; b) culturing the cell in condi- 
tions permitting expression of the fused protein; and c) 30 
detecting the location of the green-fluorescent protein in the 
cell, thereby localizing a protein of interest in a cell, wherein 
the cell normally expressing the protein of interest. 

This invention provides a method for detecting expression 
of a gene in a cell which comprises: a) introducing into the 35 
cell a DNA molecule having DNA sequence of the gene 
linked to DNA sequence encoding a green-fluorescent pro- 
tein such that the regulatory element of the gene will control 
expression of the green- fluorescent protein; b) culturing the 
cell in conditions permitting expression of the gene; and c) 40 
detecting the expression of the green-fluorescent protein in 
the cell, thereby indicating the expression of the green in the 
cell. 

This invention provides a method for indicating expres- 
sion of a gene in a subject which comprises: a) introducing 45 
into a cell of the subject a DNA molecule having DNA 
sequence of the gene linked to DNA sequence encoding a 
green-fluorescent protein such that the regulatory element of 
the gene will control expression of the green-fluorescent 
protein; b) culturing the cell in conditions permitting expres- 50 
sion of the fused protein; and c) detecting the expression of 
the green-fluorescent protein in the cell, thereby indicating 
the expression of the gene in the cell. 

In an embodiment of the above methods, the green- 
fluorescent protein is the Aequorea victoria green- 55 
fluorescent protein. 

This invention provides a method for determining the 
tissue-specificity of a DNA sequence in a subject which 
comprises: a) introducing into a cell of the subject a DNA 
molecule having the DNA sequence linked to DNA 60 
sequence encoding a green-fluorescent protein such that the 
DNA sequence will control expression of the green-fluores- 
cent protein; b) culturing the subject in conditions permitting 
the expression of the green-fluorescent protein; and c) 
detecting the expression of the green-fluorescent protein in 65 
dilFerent tissue of the subject, thereby determining the 
tissue-specificity of the DNA sequence. 
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This invention provides a method for deterrnining the 
presence of heavy metal in a solution which comprises: a) 
culturing the cell comprising a DNA molecule having a 
promoter from a gene, other than a green-fluorescent protein 
operatively linked to a DNA sequence encoding the green- 
fluorescent protein, wherein the promoter is activated by a 
heavy metal in the solution; and b) detecting expression of 
the green-fluorescent protein, the expression of the green- 
fluorescent protein indicates the presence of heavy metal. 

This invention provides a method for detecting pollutants 
in a solution which comprises: a) culturing the cell com- 
prising a DNA molecule having a promoter from a gene, 
other than a green-fluorescent protein operatively linked to 
a DNA sequence encoding the green-fluorescent protein, 
wherein the promoter is activated by a heavy metal or the 
promoter is for a stress protein in the solution; and b) 
detecting expression of the green-fluorescent protein, the 
expression of the green-fluorescent protein indicates the 
presence of heavy metal. 

This invention will be better understood from the Experi- 
mental Details which follow. However, one skilled in the art 
will readily appreciate that the specific methods and results 
discussed are merely illustrative of the invention as 
described more fully in the claims which follow thereafter. 

EXPERIMENTAL DETAILS 

Light is produced by the bioluminescent jellyfish Aequo- 
rea victoria when calcium binds to the photoprotein 
aequorin (1). Although activation of aequorin in vitro or in 
heterologous cells produces blue light, the jellyfish produces 
green light This latter light is due to the presence of a 
second protein in A. victoria called the green-fluorescent 
protein (GFP) that absorbs the energy from aequorin (2). 

Purified GFP absorbs blue light (maximally at 395 nm 
with a minor peak at 470 m) and emits green light (peak 
emission at 509 nm with a shoulder at 540 nm) (2, 3). This 
fluorescence is very stable; virtually no photobleaching is 
observed (4). As recently deduced from a cDNA clone from 
A. victoria, GFP is a protein of 238 amino acids (5). 
Although the intact protein is needed for fluorescence, the 
same absorption spectral properties as the denatured protein 
are found in a hexapeptide starting at amino acid 64 (6, 7). 
The GFP chromophore is derived from the primary amino 
acid sequence through the cyclization of Ser-dehydroTyr- 
Gly within this hexapeptide (7). The mechanisms that pro- 
duce the dehydrotyrosine and cyclize the polypeptide to 
form the chromophore are unknown. To determine whether 
additional factors from A. victoria were needed for the 
production of the fluorescent protein, applicants tested GFP 
fluorescence in heterologous systems. Applicants show here 
that GFP expressed in prokaryotic and eukaryotic cells is 
capable of producing a strong green fluorescence when 
excited by blue light. Since this fluorescence requires no 
additional gene products from A. victoria, chromophore 
formation is not species specific and occurs either through 
the uses of ubiquitous cellular components or by autoca- 
talysis. 

Expression of GFP in Escherichia coli (8) results in an 
easily detected green fluorescence (9) that is not seen in 
control bacteria. For example, florescent bacteria were easily 
seen on plates containing IPTG when they were illuminated 
with a long-wave UV source (FIG. 1). Since the cells grow 
well in the continual presence of the inducer (IPTG), GFP 
does not appear to have a toxic effect on the cells. When GFP 
was partially purified from this strain (10), it was found to 
have fluorescence excitation and emission spectra indistin- 
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guishable form those of the purified native protein (FIG. 2). 
The spectral properties of the recombinant GFP suggest that 
the chromophore can form in the absence of other A. victoria 
products. 

Transformation of the nematode Caenorhabditis elegans 5 
also resulted in the production of fluorescent GFP (11) (PIG. 
3). GFP expression was directed to a relatively small number 
of neurons using a promoter for the mec-7 gene. This gene 
encodes a p-tubulin (12) that is expressed strongly in six 
touch receptor neurons in C. elegans and weakly in a few 10 
other neurons (13, 14). The pattern of expression of GFP was 
similar to that seen with the anti-MEC-7 antibody or from 
mec-7 lacZ fusions (13-15). The strongest fluorescence was 
seen in the cell bodies of the four embryonically-derived 
touch receptor neurons (ALML, ALMR, PLML, PLMR). 15 
The processes from these cells, including their terminal 
branches, were often easily seen in larval animals. In older 
larvae, the cell bodies of the rerriaining touch cells (AVM 
and PVM) were also seen; the processes of these cells were 
more difficult to detect. These postembryonically-derived 20 
cells arise during the first of the four larval stages (16), but 
their outgrowth occurs in the following larval stages (17) 
with the cells becoming functional during the fourth larval 
stage (18). GFP's fluorescence in these cells is consistent 
with these previous results; no newly hatched or late first- 25 
stage larvae, and seven of eight young adults had at least one 
of these cells (19). In addition, moderate to weak fluores- 
cence was seen for a few other neurons (FIGS. 3, and 20). 

The expression of GFP in both E. coli and C. elegans 
differs from that of the native protein in one respect. 30 
Although relatively stable when illuminated using low inten- 
sities of 470 nm light, the fluorescence photobleaches rap- 
idly (within seconds) when the cells are illuminated with 
light of approximately 395 nm. Applicants do not know 
whether this difference is caused by the alteration of the 35 
protein-coding sequence (8), by the absence of a necessary 
post-translational modification, or by non-specific damage 
within the cells. However, when cells in C. elegans have 
been photobleached, some recovery is seen within 10 min- 
utes. Further investigation is needed to determine whether 40 
this recovery represents de novo synthesis of GFP. 

Several methods are available to monitor gene activity 
and protein distribution within cells. These include the 
formation of fusion proteins with coding sequences for 45 
p-galactosidase, firefly luciferase, and bacterial luciferase 
(21). Because these methods require exogenously-added 
substrates or cofactors, they are of limited use with living 
tissue. Since detection of intracellular GFP requires only the 
radiation by near UV or blue light, it is not substrate limited. 5Q 
Thus, it should provide an excellent means of monitoring 
gene expression and protein localization in living cells (22). 

Because it does not appear to interfere with cell growth 
and function, GFP should also be a convenient indicator of 
transformation (and one that could allow cells to be sepa- 55 
rated using fluorescence-activated cell sorting). Applicants 
also envision that GFP can be used as a vital marker so that 
cell growth (for example, the elaboration of neuronal pro- 
cesses) and movement can be followed in situ, especially in 
animals that are essentially transparent like C. elegans and 60 
zebra fish. 

References and notes 

1. Shimomura, O., Johnson, F. H., Saiga, Y., (1962) /. 
Cell Comp. Physiol, 59:223. 65 

2. Morise, H., Shimomura, O., Johnson, E H. and Winant, 
J. (1974), Biochemistry, 13:2656. 



3. Ward, W. W., Cody, C. W., Hart, R. C. and Cormier, M. 
J., (1980), Photochem. Photobiol, 31:611. 

4. Prendergast, F. G., personal communication. 

5. Prasher, D. C, Eckenrode, V. K. ? Ward, W. W., Pren- 
dergast, F. G., and Cormier, M. J., (1992), Gene, 111:229. 

6. Shimomura, A., (1979), FEBS Lett., 104:220. 

7. Cody, C. W., Prasher, D. C, Westler, W. M. t Prender- 
gast,F. G., and Ward, W. W., (1993), Biochemistry, 32:1212. 

8. Plasmid pGFPlO.l was formed by placing the Eco RI 
fragment encoding the GFP cDNA from XgflplO (5) into 
pBS(+) (Stratagene®). The fragment was obtained by ampli- 
fication using the polymerase chain reaction (PCR; R. K. 
Saiki, et al., Science, 239:487 (1988)) with primers flanking 
the Eco RI sites and subsequent digestion with Eco RI. The 
sequence of the cDNA in pGFPlO.l differs from the pub- 
lished sequence by a change in codon 80 within the coding 
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translation and an Eco RI site 5' primer was ACAAAG- 
GCTAGCAAAGGAGAAGAAC (Sequence ID No.: 1) and 
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pET3a [A. H. Rosenberg., et al., Gene, 56:125, (1987)] by 
standard methods [J. Sambrook, E. F., Fritsch, and T. Mania- 
tis, Molecular cloning: A laboratory manual, 2nd ed., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 
(1989)]. The E. coli strain BL21 (DE3)Lys [F. W. Studier and 
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were grown on nutrient plates containing 100 ug/ml ampi- 
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0.1 mM PMSF, 1 ug/ml pepstatin A, 1 ug/ml leupeptin, and 
2 ug/ml aprotinin, and centrifuge at 5,000 rpm for 5' in the 
cold. The supernatant was centrifuged a second time (15,000 
rpm for 15') and then diluted seven fold with 10 mM Tris pH 
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and emission spectra were obtained using a SPEX FTTH 
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from A. victoria (M. Cutler, A. Roth and W. W. Ward, 
unpublished data). The excitation spectra, were measured 
from 300-500 nm with a fixed emission wavelength of 509 
nm and the emission spectra, were measured from 410-600 
nm with a fixed excitation of 395 nm. All spectra were 



5,491,084 



10 



10 



15 



20 



25 



recorded as signal/reference data (where the reference is a 
direct measurement of the lamp intensity using a separate 
photomultiplier tube) at room temperature with 1 second 
integration time and 1 nm increment. The spectral hand 
width were adjusted to 0.94 nm for all spectra. 5 

11. Wild- type and mutant animals were grown and genetic 
strains were constructed according to S. Brenner, Genetics, 
77:71 (1974). 

The plasmid p3FP10.1 was used as a template in PCR to 
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was ligated into plasmid pPD 16.51 [12; A. Fire, S. W. 
Harrison, and D. Dixon, Gene, 93:189 (1990)], a vector 
containing the promoter of the C. elegans mec-7 gene. 
Wild-type C. elegans were transformed by coinjecting this 
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For C. elegans, applicants find that a long-pass emission 3Q 
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was strong like in the uls4 animals). The uls4 animals, 
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that specifies touch cell fate [13, 15, 18, J. C. Way and M. 
Chalfie, Cell, 54:5 (1988)]. It has been shown previously 
that mec-7 expression is reduced in the ALM touch cells of 
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I-Eco RI fragment encoding GFP with an Age I site 5' to the 
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SEQUENCE LISTING 



( 1 ) GENERAL INFORMATION: 



( i i i ) NUMBER OF SEQUENCES: 1 



( 2 ) INFORMATION FOR SEQ ID N0:1: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 25 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDED NESS: single 
( D ) TOPOLOGY: linear 
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-continued 

( i i ) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL: NO 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Escherichia coli 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOU: 

ACAAAOOCTA GCAAAGGAGA AGAAC 



What is claimed is: 

1. A host cell comprising a DNA molecule having a 
regulatory element from a gene, other than a gene encoding 15 
an Aequorea victoria green-fluorescent protein operauvely 
linked to a DNA sequence encoding the fluorescent Aequo- 
rea victoria green-fluorescent protein. 

2. A cell of claim 1, wherein the cell is selected from a 
group consisting of bacterial cell, yeast cell, fungal cell, 20 
plant cell or animal cell. 

3. A cell of claim 1, wherein the regulatory element is a 
promoter. 

4. A cell of claim 3, wherein the promoter is activated by 

a heavy metal. 25 

5. A cell of claim 3, wherein the promoter is a P450 
promoter. 

6. A cell of claim 3, wherein the promoter is from a gene 
encoding a stress protein. 

7. A cell of claim 6, wherein the stress protein is a 30 
heat-shock protein. 

8. A cell of claim 1, wherein the regulatory element is an 
enhancer. 

9. A method for selecting cells expressing a protein of 
interest which comprises: 35 

(a) introducing into the cells a DNAI molecule having 
DNA sequence encoding the protein of interest and 
DNAJ3 molecule having DNA sequence encoding an 
Aequorea victoria green-fluorescent protein; 

(b) culturing the introduced cells in conditions permitting 40 
expression of the Aequorea victoria green-fluorescent 
protein and the protein of interest; and 

(c) selecting the cultured cells which express Aequorea 
victoria green-fluorescent protein, thereby selecting 
cells expressing the protein of interest, 45 

wherein DNAI and DNAII are linked. 



2 5 



10. A method for selecting cells expressing a protein of 
interest which comprises: 

(a) introducing into the cells a DNAI molecule having 
DNA sequence encoding the protein of interest and 
DNAII molecule having DNA sequence encoding an 
Aequorea victoria green-fluorescent protein; 

(b) culturing the introduced cells in conditions permitting 
expression of the Aequorea victoria green-fluorescent 
protein and the protein of interest; and 

(c) selecting the cultured cells which express Aequorea 
victoria green-fluorescent protein, thereby selecting 
cells expressing the protein of interest, 

wherein the cells are selected from a group consisting of 
yeast cells, fungal cells, insect cells, nematode ceils, plant or 
animal cells. 

11. A method for localizing a protein of interest in a cell 
which comprises: 

(a) introducing into a cell a DNA molecule having DNA 
sequence encoding the protein of interest linked to 
DNA sequence encoding an Aequorea victoria green- 
fluorescent protein such that the protein produced by 
the DNA molecule will have the protein of interest 
fused to the Aequorea victoria green-fluorescent pro- 
tein; 

(b) culturing the cell in condition permitting expression of 
the fused protein; and 

(c) detecting the location of the fluorescence of the fused 
protein in the cell, thereby localizing a protein of 
interest in a cell. 

12. A method of claim 11, wherein the cell normally 
expresses the protein of interest 

***** 
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ABSTRACT 



Modifications in the sequence of Aequorea wild-type GFP 
provide products having markedly different excitation and 
emission spectra from corresponding products from wild- 
type GFP. In one class of modifications, the product derived 
from the modified GFP exhibits an alteration in the ratio of 
two main excitation peaks observed with the product derived 
from wild-type GFP. In another class, the product derived 
from the modified GFP fluoresces at a shorter wavelength 
than the corresponding product from wild-type GFP. In yet 
another class of modifications, the product derived from the 
modified GFP exhibits only a single excitation peak and 
enhanced emission relative to the product derived from 
wild-type GFP. 

64 Claims, 7 Drawing Sheets 
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MODIFIED GREEN FLUORESCENT wavelength discrimination between different fluorophores 

PROTEINS and detection of resonance energy transfer are easier if the 

component spectra are tall and narrow rather than low and 
This is a divisional application of U.S. application Ser. broad. Furthermore, the longer wavelength excitation peak 
No. 0&T727.452. filed Oct. 18. 1996. which is a continuation 5 (475 nm) of Aequorea GFP is almost ideal for fluorescein 
of PCT/US95/14692. filed Oct. 10, 1995, which is a filter sets and is resistant to photobleaching. but has lower 
continuation-in-part of U.S. application Ser. No. 08/337. amplitude than the shorter wavelength peak at 395 nm. 
915. filed Nov. 10.1994. which has issued as U.S. Pat No. which is more susceptible to photobleaching IChalfie et al. 
5.625.048. (1994). supra). For all these reasons, it would clearly be 
The invention was made with Government support under 10 advantageous to convert the Aequorea GFP excitation spec- 
Grant No. NS27177, awarded by the National Institute of trum to a single peak, and preferably at longer wavelengths. 
Health. The Government has certain rights in this invention. nw u ^ a necd in ftc m for protdns whjch fluore$ce 

BACKGROUND OF THE INVENTION at different wavelengths. Variants of fluorescent proteins 

This invention relates generally to the fields of biology „ with 001015 wouW 3150 * verv useful for simulta - 

and chemistry. More particularly, the invention is directed to ncous co«P»Kons of multiple protein fates, developmental 

modified fluorescent proteins and to methods for the prepa- lineages, and gene expression levels, 

ration and use thereof. Accordingly, it is an object of the present invention to 

In biochemistry, molecular biology and medical provide improved fluorescent proteins which do not suffer 

diagnostics, it is often desirable to add a fluorescent label to ^ from drawbacks of native Aequorea GFP. 
a protein so that the protein can be easily tracked and 

quantified The normal procedures for labeling requires that SUMMARY OF THE INVENTION 
the protein be covalently reacted in vitro with fluorescent jj, accordance with the present invention, it has been 
dyes, then repurified to remove excess dye and any damaged determined that particular modifications in the polypeptide 
protein. If the labeled protein is to be used inside cells, it ^ sequence of an Aequorea wild-type GFP |SEQ ID NO:2| 
usually has to be microinjected; this is a difficult and lead to formation of products having markedly different 
time-consuming operation that cannot be performed on large excitation and emission spectra from corresponding prod- 
numbers of cells. These problems may. however, be elimi- ucts derived from wild-type GFP. Visibly distinct colors 
nated by joining a nucleotide sequence coding for the and/or increased intensities of emission make these products 
protein of interest with the sequence for a naturally fluores- ^ use ful in a wide variety of contexts, such as tracking of 
cent protein, then expressing the fusion protein. differential gene expression and protein localization. 

The green fluorescent protein (GFP) of the jellyfish 

Aequorea victoria is a remarkable protein with strong visible BRIEF DESCRIPTION OF THE DRAWINGS 

absorbance and fluorescence from a _ . . t , . , 

p-hdroxybenzyUdeneimidazolone chromophore. which is 35 J** Inventl0n *** * ^stood with reference to 

generated by cyclization and oxidation of the protein's own the *™™f>™ymg drawings, in which: 

Ser-TVT-Gly sequence at positions 65 to 67. A cDNA 1 compares different versions of GFP by gel eiec- 

sequence |SEQ ID NO: 1 1 for one isotype of GFP has been trophoresis and Coomassie blue staining; 

reported (Prasher. D. C ct al.. Gene 111. 229-233 (1992)|; FIG. 2 illustrates a proposed biosynthetic scheme for 

cloning of this cDNA has enabled GFP expression in dif- 40 GFP; 

ferent organisms. The finding that the expressed protein FIGS. 3o and 36 illustrate the excitation and emission 

becomes fluorescent in cells from a wide variety of organ- spectra of wild-type and a first group of mutant GFPs: 

isms IChalfie. M. et al.. Science 263 . 802-805 (1994)1 FIGS. 4a and 4b illustrate the excitation and emission 

makes GFP a powerful new tool in molecular and cell a wikMypc and a g^p of mutant G FPs: 

biology and indicates that the oxidative cyclization must be 4 < . ^ „ , F ... 

either spontaneous or dependent only on ubiquitous -f? ^US™ 16 ? ? C l?* 6 « ^"°^°P* 10rc f 01 ™ 311011 m toe 

enzymes and reactants. wiliUype GFP and the Ser 65->Thr mutant; 

A major question in protein photophysics is how a single nQS 6a and 66 illustrate the behavior of wild-type GFP 
chromophore can give widely different spectra depending on and * c Scr 65 ^ Thr mutant, respectively, upon progressive 
its local protein environment This question has received the 50 i™^ 011 with ultraviolet light; and 
most attention with respect to the multiple colors of visual FIG. 7 illustrates fluorescence excitation and emission 
pigments based on retinal IMerbs. S. L. & Nathans. J. spectra of a third group of GFP mutants. 
Science 258. 46-466 (1992)|. but is also important in GFP. nF^nmm hp thf 
The GFP from Aequorea and that of the sea pansy Renilla DETAILED DESCRIPTION OF THE 
reniformis share the same chromophore. yet Aequorea GFP 55 INVENTION 
has two absorbance peaks at 395 and 475 nm, whereas GFP was expressed in E. colt under the control of a T7 
Renilla GFP has only a single absorbance peak at 498 nm, promoter for quantitative analysis of the properties of the 
with about 5.5 fold greater monomer extinction coefficient recombinant protein. Gel electrophoresis under denaturing 
than the major 395 nm peak of the Aequorea protein (Ward. conditions showed protein of the expected molecular weight 
W. W. in Bioluminescence and Chemiluminescencc (eds. 60 (27 kDa) as a dominant band (FIG. 1). which could be 
DeLuca. M.A. & McElroy. W. D.) 235-242 (Academic quantified simply by densitometry of staining with Coo- 
Press. New York. 1981)1. The spectra of the isolated chro* massie blue. Soluble recombinant GFP proved to have 
mophore and denatured protein at neutral pH do not match identical spectra and the same or even slightly more fluo- 
the spectra of either native protein ICody. C. W. et al., rescence per mole of protein as GFP purified from Aequorea 
Biochemistry 32. 1212-1218 (1993)). 65 victoria, showing that the soluble protein in E. coli under- 

For many practical applications, the spectrum of Renilla goes correct folding and oxidative cyclization with as high 

GFP would be preferable to that of Aequorea. because an efficiency as in the jellyfish. 



5.777.079 



The bacteria also contained inclusion bodies consisting of 
protein indistinguishable from jellyfish or soluble recombi- 
nant protein on denaturing gels (FIG. 1). However, this 
material was completely non-fluorescent, lacked the visible 
absorbance bands of the chrornophore. and could not be 
made fluorescent even when solubilized and subjected to 
protocols that re nature GFP I Ward. W. W. & Bokman. S. H., 
Biochemistry 21. 4535-4540 (1982); Surpin. M. A. & Ward 
W. W.. Photochem. Photobioi. 49. Abstract, 25S (1989)1. 
Therefore, protein from inclusion bodies seemed perma- 
nently unable to generate the internal chromophore. An 
interesting intermediate stage in protein maturation could be 
generated by growing the bacteria anaerobicaily.The soluble 
protein again looked the same as GFP on denaturing gels 
(FIG. 1) but was non-fluorescent. In this case, fluorescence 
gradually developed after admission of air. even when fresh 
protein synthesis was blocked using puromycin and tetra- 
cycline. Evidently, the soluble non-fluorescent protein syn- 
thesized under anaerobic conditions was ready to become 
fluorescent once atmospheric oxygen was readmitted. The 
fluorescence per protein molecule approached its final 
asymptotic value with a single-exponential time course and 
a rate constant of 0.24+0.06 hT 1 (at 22° C) measured either 
in intact cells with protein-synthesis inhibitors or in a lysate 
in which the soluble proteins and cof actors were a thousand 
fold more dilute. Such pseudo-first order kinetics strongly 
suggest that no enzymes or cofactors are necessary for the 
final step of fluorophore formation in GFP. 

It has thus been determined that formation of the final 
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rated. They all lay in the C terminal part of the protein (Table 
I), remote in primary sequence from the chromophore 
formed from residues 65-67. 

These and other modifications are defined herein with 
reference to the amino acid sequence |SEQ ID NO:2| 
encoded by the reported cDNA (SEQ ID NO:l|: the first 
amino acid identified is the one found at the indicated 
Location in the reported sequence, while the second indicates 
the substitution found in the modified form. The fluorescent 
product derived from a wild-type or modified GFP polypep- 
tide sequence is no longer strictly speaking a simple 
polypeptide after oxidation and cyclization; however, refer- 
ence is sometimes made for sake of simplicity herein to the 
polypeptide (e.g.. "wild-type GFP" or "modified GFF*) 
where what is intended would be obvious from the context. 
Compared with wild-type GFP. H9 (Ser 202-»Phe, Thr 
203->Ile) had increased fluorescence at 395 nm excitation; 
P9 (lie 167-»Val) and Pll (lie 167->Thr) were more fluo- 
rescent at 475 nm excitation. 

One possibility for these spectral perturbations in P9 and 
Pll is that the mutations at He 167 shift a positive charge 
slightly closer to the phenolic group of the fluorophore; this 
should both increase the percentage of phenolic anion, 
which is probably the species responsible for the 470-475 
nm excitation peak, and shift the emission peak hypsochro- 
mically. However, the hypothesized ionizable phenolic 
group would have to be buried inside the protein at normal 



pH. because the ratio of 471 to 396 nm peaks in the mutants 

0 . . _ . „^^a c • could not be further affected by external pH until it was 

fluorophore requires molecular oxygen and V™f ™ 3, ^ d * natoaUon/m 

wild-type protein with a time constant of -4 h at 22° C. and _ h _ uu ^ ^ „, :1/I _ ^in> *i„.t„ ivx/w w w « 
atmospheric p0 2 . This was independent of dilution, imply- 
ing that the oxidation does not require enzymes or cofactors. 

A molecular interpretation is presented in FIG. 2. If the 
newly translated apoprotein (top left) evades precipitation 35 
into inclusion bodies, the amino group of Gly 67 might 
cyclize onto the carbonyl group of Ser 65 to form an 
imidazolidin-5-one. where the process would stop (top 
center) if 0 2 is absent. The new N=C double bond would be 
expected to promote dehydrogenation to form a conjugated 40 
chromophore; irnidazolidin-5-ones are indeed known to 
undergo autoxidative formation of double bonds at the 
4-position |Kjaer. A. Acta Chem. Scand. 7, 1030-1035 
(1953); Kidwai. A. R. & Devasia. G. M. J. Org, Chem, 27. 
4527-4531 (1962>U which is exactly what is necessary to 45 
complete the fluorophore (upper right). The protonated and 
deprotonated species (upper and lower right) may be respon- 
sible for the 395 and 470-475 nm excitauon peaks, respec- 
tively. The excited states of phenols are much more acidic 
than their ground states, so that emission would come only 50 
from a deprotonated species. 

The Aequorea GFP cDNA was subjected to random 
mutagenesis by hydroxylarnine treatment or polymerase 
chain reaction. Approximately six thousand bacterial colo- 
nies on agar plates were uUuminated with alternating 395 and 55 
475 nm excitation and visually screened for altered excita- 
tion properties or emission colors. 

According to a first aspect of the present invention, 
modifications are provided which result in a shift in the ratio t 
of the two excitations peaks of the product after oxidation 60 *oMe mutant Y66H, Y145F was identified which ha< 
and cyclization relative to the wild type. Three mutants were almost the same wavelengths as the single mutant Y66H bu 
found with significant alterations in the ratio of the two main almost twice the brightness, due mainly to a higher quantun 
excitauon peaks (Table I). The mutations were sequenced efficiency of fluorescence. The double mutant also devel 
and recombined with the wild-type gene in different ways to oped its fluorescence during overnight growth, whereas th* 
eliminate neutral mutations and assign the fluorescence 65 single mutant required several days, 
effects to single amino acid substitutions, except for H9 In accordance with further embodiments of this aspect 0 
where two neighboring mutations have not yet been sepa- the invention, a first round of mutagenesis to increase th» 



pH- sensitivity of wild-type GFP is similar | Ward. W. W. et 
al.. Photochem, Photobioi 35. 803-808 (1982) |. 

According to another aspect of the invention, a mutant P4 
(Tyj 66— > His) was identified which was excitable by ultra- 
violet and fluoresced bright blue in contrast to the green ol 
wild type protein. The excitation and emission maxima were 
hypsochromically shifted by 14 and 60 nm respectively from 
those of wild-type GFP. The mutated DNA was sequenced 
and found to contain five amino acid substitutions, only one 
of which proved to be critical: replacement of TVr 66 in the 
center of the chromophore by His (corresponding to a 
change in the GFP cDNA sequence |SEQ ID NO:l| al 
196-198 from TAT to CAT). 

The surprising tolerance for substitution at this key resi- 
due prompted further site-directed mutagenesis to Trp an<! 
Phe at this position. Trp gave excitation and emissior 
wavelengths intermediate between iyr and His (Table I) bui 
was only weakly fluorescent, perhaps due to inefficiency ol 
folding or chromophore formation due to steric consider 
auons. Phe gave weak fluorescence with an excitation maxi 
mum at 358 nm and an emission maximum at 442 nm 
Accordingly, pursuant to this aspect of the invention modi 
fied GFP proteins which fluoresce at different wavelength: 
(preferably, different by at least 10 nm and more preferably 
by at least 50 nm) relative to the native protein are provided 
for example, those wherein Tyr 66 is replaced by Phe. Hi 
or Trp. 

In a further embodiment of this aspect of the invention. 
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brightness of Y66W yielded M153T/V163A/N212K as ously crucial. Che locations of known neutral mutations 

additional substitutions. This mutant was subjected to suggest that amino acids 76-115 are less critical to the 

another round of mutagenesis, resulting in two further sets, spectroscopic properties of the product. In addition, as 

N146I and I123V/Y145H/H148R (Table II). The quantum would be immediately apparent to those working in the field. 

efficieDcy of these mutants is now comparable to wild-type 3 the use of various types of fusion sequences which lengthen 

GFP. The clustering of the substitutions in residues 145 to the resultant protein and serve some functional purpose in 

163 suggest that those residues lie relatively close to the the preparation or purification of the protein would also be 

chromophore and that reductions in the size of their side routine and are contemplated as within the scope of the 

chains might be compensating for the larger size of tryp- present invention. For example, it is common practice to add 

tophan compared to tyrosine. 10 amino acid sequences including a polyhistidine tag to facili- 

Puxsuant to yet another aspect of the present invention. tate purification of the product proteins. As such fusions do 

modified GFP proteins are provided which provide substan- not significantly alter the salient properties of the molecules 

tially more intense fluorescence per molecule than the wild comprising same, modified GFPs as described herein includ- 

type protein. Modifications at Ser 65 to Ala. leu. Cys. Val, ing such fusion sequences at either end thereof are also 

lie or Thr provide proteins with red- shifted and brighter !5 clearly contemplated as within the scope of the present 

spectra relative to the native protein. In particular, the Thr invention. 

mutant (corresponding to a change in the GFP cDNA Similarly, in addition to the specific mutations disclosed 

sequence |SEQ ID NO:l| at 193-195 from TCT to ACT) herein, it is well understood by those working in the field 

and Cys mutant (corresponding to a change in the GFP that in many instances modifications in particular locations 

cDNA sequence |SEQ ID NO:l| at 193-195 from TCT to jo in the polypeptide sequence may have no effect upon the 

TGT) are about six times brighter than wild type when properties of the resultant polypeptide. Unlike the specific 

excited at the preferred long-wavelength band above 450 mutations described in detail herein, other mutations provide 

nm As a consequence, these modified proteins are superior polypeptides which have properties essentially or substan- 

to wild type proteins for practically all applications. Further. tially indistinguishable from those of the specific polypep- 

the brightness of these modified proteins matches the bright- 2 $ tides disclosed herein. For example, the following substitu- 

ness reported in the literature for Renilla GFP; thus, these tions have been found to be neutral (i.e., have no significant 

proteins clearly obviate the objections to the dimness of impact on the properties of the product): Lys 3-»Arg; Asp 

Aequorea GFP. In fact, it is speculated that the chro- 76-*Gly; Phe 99->De; Asn 105->Ser; Glu 115-»Val; Thr 

raophores in these modified proteins may exhibit the opti- 225-*Ser; and Lys 238-K3lu. These equivalent polypep- 

mum brightness which could be achieved with a general 30 tides (and oligonucleotide sequences encoding these 

structure derived from the Aequorea GFP chromophore. In polypeptides) are also regarded as within the scope of the 

particular, these mutations provide products exhibiting one present invention. In general, the polypeptides and oligo- 

or more of the following salient characteristics which dis- nucleotide sequences of the present invention (in addition to 

tinguish them clearly over the corresponding product from a containing at least one of the specific mutations identified 

wild-type GFP: reduced efficiency of excitation by wave- 35 herein) will be at least about 85 % homologous, more 

lengths between about 350 and 420 nm; enhanced excitation preferably at least about 90% homologous, and most pref- 

and emission efficiency when excited with wavelengths erably at least about 95% homologous, to the wild-type GFP 

longer than about 450 nm; increased resistance to light- described herein. Because of the significant difference in 

induced shifts in the excitation spectrum; and faster kinetics properties observed upon introduction of the specified modi- 

of fluorophore generation. In contrast, mutations toTrp. Arg, ^ fications into a GFP sequence, the presence of the specified 

Asn, Phe and Asp did not provide improved brightness. modifications relative to the corresponding reported 

Mutagenesis of S65T to shift its wavelengths further to sequence for wild-type GFP |SEQ ID NO:2| are regarded as 

the red yielded M 153A/K238E (Table ID as the GFP variant central to the invention. 

with the longest-wavelength excitation maximum yet The oligonucleotide sequences of the present invention 
described. 504 nm vs. 490 nm for S65T. Surprisingly, the 45 are particularly useful in processes for labelling polypep- 
emission peak hardly changed (514 nm vs. 5 11 nm), so mat tides of interest, e.g., by the construction of genes encoding 
the separation between the excitation and emission peaks fluorescent fusion proteins. Fluorescence labeling via gene 
(Stokes* shift) is extremely narrow, only 10 nm. This is one fusion is site-specific and eliminates the present need to 
of the smallest values reported for any fluorophore in purify and label proteins in vitro and microinject them into 
aqueous solution at room temperature. As in the Y66W 50 cells. Sequences encoding the modified GFPs of the present 
series. Ml 53 seems to be influential. It is doubtful that invention may be used for a wide variety of purposes as are 
K238E is important, because this substitution has been well known to those working in the field. For example, the 
found to be without effect in other mutants. sequences may be employed as reporter genes for monitor- 
As would be readily apparent to those working in the ing the expression of the sequence fused thereto; unlike 
field, to provide the desired fluorescent protein it would not 55 other reporter genes, the sequences require neither substrates 
be necessary to include the entire sequence of GFP. In nor cell disruption to evaluate whether expression has be 
particular, minor deletions at either end of the protein achieved. Similarly, the sequences of the present invention 
sequence are expected to have little or no impact on the may be used as a means to trace lineage of a gene fused 
fluorescence spectrum of the protein. Therefore, by a mutant thereto during the development of a cell or organism, 
or wild-type GFP sequence for purposes of the present 60 Further, the sequences of the present invention may be used 
invention are contemplated not only the complete polypep- as a genetic marker; cells or organisms labeled in this 
tide and oligonucleotide sequences discussed herein, but manner can be selected by, e.g.. fluorescence- activated cell 
also functionally-equivalent portions thereof (i.e.. portions sorting. The sequences of the present invention may also be 
of the polypeptide sequences which exhibit the desired used as a fluorescent tag to monitor protein expression in 
fluorescence properties and oligonucleotide sequences 65 vivo, or to encode donors or acceptors for fluorescence 
encoding these polypeptide sequences). For example. resonance energy transfer. Other uses for the sequences of 
whereas the chromophore itself (position 65-67) is obvi- the present invention would be readily apparent to those 
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working in the field, as would appropriate techniques fa- 
fusing a gene of interest to an oligonucleotide sequence of 
the present invention in the proper reading frame and in a 
suitable expression vector so as to achieve expression of the 
combined sequence. 5 

The availability of several forms of GFP with such 
different spectral properties should facilitate two-color 
assessment of differential gene expression, developmental 
fate, or protein trafficking. For example, if one wanted to 
screen for a drug that is specific to activate expression of io 
gene A but not gene B. one could fuse the cDNA for one 
color of GFP to the promoter region of gene A and fuse the 
cDNA for another color to the promoter region of gene B. 
Both constructs would be transfected into target cells and the 
candidate drugs could be assayed to detainine if they 15 
stimulate fluorescence of the desired color, but not fluores- 
cence of the unde sired color. Similarly, one could test for the 
simultaneous expression of both A and B by searching for 
the presence of both colors simultaneously. 

As another example, to examine the precise temporal or 20 
spatial relationship between the generation or location of 
recombinant proteins X and Y within a cell or an organism, 
one could fuse genes for different colors of GFP to the genes 
respectively. If desired. DNA 



Bamrfl sites at the 5'and 3'ends. respectively, and was 
cloned behind the T7 promoter of pGEMEX2 (Promega) 
replacing most of the T7 gene 10. The resulting plasmid was 
transformed into the strain JM109(Dat) (Promega Corp.. 
Madison. Wis.), and high level expression was achieved by 
growing the cultures at 24° C. to saturation without induc- 
tion by IFTG. To prepare soluble extracts. 1.5 ml cell 
suspension were collected, washed and resu speeded in 150 
ul 50 mM Tris/HCI. pH 8.0. 2 mM EDTA. Lysozyme and 
DNAse I were added to 0.2 mg/rol and 20 ug/ml. 
respectively, and the samples were incubated on ice until 
lysis occurred ( 1-2 hours). The lysates were then clarified by 
centrifuging at 12.000*g for 15 minutes. Inclusion bodies 
were obtained as described in the literature ISambrook. J. et 
al. in Molecular Cloning: A Laboratory Manual Vol. 2. 
17.37-17.41 (Cold Spring Harbor Press. Cold Spring 
Harbor. New York. 1989) |. 

As illustrated in FIG. 1. soluble extracts of E. colt 
expressing GFP show a predominant band which is absent in 
extracts from control cells and has the same electrophoretic 
mobility as native GFP isolated from the jellyfish A. Victo- 
ria. Inclusion bodies of expressing cells consist mainly of 
non-fluorescent GFP which has the same mobility as soluble 
GFP. Non-fluorescent soluble GFP of anaerobically grown 



for proteins X and Y. ^ ^ _ 

sequences encoding flexible oligopeptide spacers could be 25 cu | lurcs j s ai s0 a major band with correct mobility. Soluble 

included to allow the linked domains to function autono- cxtracts of m e mutated clones H9. P9. Pll and P4 again 

mously in a single construct By examining the appearance conta i n a dominant protein with essentially the same 

of the two distinguishable colors of fluorescence in the very mo lecular weight, 
same cells or organisms, one could compare and contrast the 

generation or location of the proteins X and Y with much 30 R^dom mutagenesis of the GFP cDNA was done by 

Later vision and less biological variability than if one increasing the error rate of the po^erase .chain reaction 



greater precision and less biological variability 
had to compare two separate sets of cells or organisms, each 
containing just one color of GFP fused to either protein X or 
Y. Other examples of the usefulness of two colors would be 
obvious to those skilled in the art. 

The further mutations to brighten the Y66H and Y66W 
variants of GFP enhance the possibility of using two or three 
colors of fluorescent protein to track differential gene 
expression, protein localizations or cell fates. For example. 



35 



with 0.1 mM MnCl 2 . 50 uM dATP and 200 uM of dGTP. 
dCTP. and dTTP IMuhlrad, D. et al.. Yeast 8. 79-82 ( 1992) |. 
The product was ligated into pGEMEX2 and subsequently 
transformed into JM109(DE3). Colonies on agar were visu- 
ally screened for different emission colors and ratios of 
brightness when excited at 475 vs. 395 nm 

FIGS. 3a and 3b illustrate the excitation and emission 
spectra of wild-type and mutant GFPs. In FIGS. 3a and 3b. 



mutants P4-3 (Y66H/Y145F). W7 (Y66W/N146T/M153T/ 40 .. wild-type; - - S202F. T203I; - - - I167T; Y66W: 

V163A/N212K) and S65T can all be distinguished from -#- •Y66H. Samples were soluble fractions from E. coli 

each other. P4-3 is specifically detected by exciting at expressing the proteins at high level, except for Y66W. 

290-370 nm and collecting emission at 420-460 nm. W7 is which was obtained in very low yield and measured on intacl 

specifically detected by exciting at 41(W57 ran and collect- cells. Autofluorescence was negligible for all spectra excepi 

ing emission at 465^95 nm. S65T is specifically detected 45 those of Y66W. whose excitation spectrum below 380 nrc 

by exciting at 483-493 nm and collecting emission al may be contaminated by autofluorescence. Excitation and 

wavelengths greater than 510 nm. Bacteria carrying these emission spectra were measured with 1.8 nm bandwidth; 

three proteins are readily discriminated under a microscope and the non-scanning wavelength set to the appropriate 

using the above wavelength bandpass filters. peak. Excitation spectra were corrected with a rhodamine E 

The chromophore in GFP is well buried inside the rest of 50 quantum counter, while emission spectra (except for Y66W; 



the protein, so much of the dimness of the original point 
mutants was presumably due to steric mismatch between the 
substituted amino acid and the cavity optimized for tyrosine. 
The location of the beneficial mutations implies that residues 
145-163 are probably close to the chromophore. The 53 
M153A/S65T mutant has the longest wavelengths and 
smallest Stokes* shift of any known fluorescent protein that 
does not use a cofactor. 

The invention may be better understood with reference to 
the accompanying examples, which are intended for pur- 
poses of illustration only and should not be construed as in 
any sense limiting the scope of the invention as defined by 
the claims appended hereto. 

EXAMPLE 1 

The coding region of GFP clone 10.1 IPrasher et al. 
(1992). supra! was amplified by PCR to create Ndel and 



were corrected for monochromator and detector efficiencie; 
using manufacturer-supplied correction spectra. All ampli- 
tudes have been arbitrarily normalized to a maximum value 
of 1.0. A comparison of brightness at equal protein concen- 
trations is provided in Table I. 

TABLE I 

Characteristics of nmUtcd vs. wild-type GFP 



60 



65 







Excitation 


Emission 


Relative* 


Variant 


Mutation 


Maxima (nm)* 


Maxima (nm) b 


Fluorescence 


Wild 


none 


396 (476) 


508(503) 


(sl00%) 


type 










H9 


Ser 202-KPbc. 


398 


5U 


1179b" 




Thr 203-+rjc 








P9 


Ue l67->Vul 


471 (396) 


502 (507) 


166%* 
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TABLE I-continued 





Characteristics of mutated vs 


. wild-tvDc GFP 








Excitation 


Emission 


Relative' 


Variant 


Mutation 


Maxima (nm)" 


Maxima (nm) b 


Fluorescence 


Pll 


lie !67->Thr 


471 (396) 


502(507) 


m%* 


P4 


TVr66->His 


382 


448 


57%' 


W 


Tyr 66->Trp 


458 


480 


iLd. 



5 



"Values in parentheses are lower-amplitude peaks. 

b Primary values were observed when exciting at the main excitation peak; 
values in parentheses were observed when illuminating at the lower- 
amplitude excitation peak. 

'Equal amounts of protein were used based on densitometry of gels stained 
with Coomassie Blue (Fig. 1). 

''Emission maxima of spectra recorded at excitation 395 am were compared 15 
"Emission maxima of spectra recorded at excitation 475 am were compared. 
Emission spectrum of P4 recorded at 378 nm excitation was integrated and 
compared to the integrated emission spectrum of wild type recorded at 475 
nm excitation; both excitation and emission characteristics were corrected. 

EXAMPLE 2 20 
Oligonucleotide-directed mutagenesis at the codon for 
Ser-65 of GFP cDNA was performed by the literature 
method | Kunkel.T. A. (1985) Pmc. Natl. Acad. ScL USA 82. 
488| using the Muta-Gene Phagemid in Vitro Mutagenesis 2 5 
Kit version 2. commercially available from Bio-Rad. 
Richmond. Calif. The method employs a bacterial host strain 
deficient for dUTPase (dut) and uracil-N-glycosylase (ung). 
which results in an occasional substitution of uracil for 
thymine in newly-synthesized DNA. When the uracil- 30 
containing DNA is used as a wild-type template for 
oligonucleotide-directed in vitro mutagenesis, the comple- 
mentary (mutant) strand can be synthesized in the presence 
of deoxy nucleotides, ligase and polymerase using the 
mutagenic oligonucleotide to prime DNA synthesis; the 35 
Version 2 kit utilizes unmodified T7 DNA polymerase to 
synthesize the complementary strand. When the heterodu- 
plex molecule is transformed into a host with an active 
uracil -N-gly co sylase (which cleaves the bond between the 
uracil base and the ribose molecule, yielding an apyrimidic 40 
site), the uracil-oontaining wild-type strand is inactivated, 
resulting in an enrichment of the mutant strand. 

The coding region of GFP cDNA was cloned into the 
BamHI site of the phagemid pRSET^ from Invitrogeo (San 
Diego. Calif.). This construct was introduced into the dut 45 
ung double mutant E. coli strain CJ236 provided with the 
Muta-Gene kit and superinfected with helper phage 
VCSM 13 (Stratagene, La Jolla, Calif.) to produce phagemid 
particles with single-stranded DNA containing some uracils 
in place of thymine. The uracii-containing DNA was purified 50 
to serve as templates for in vitro synthesis of the second 
strands using the mutagenic nucleotides as primers. The 
DNA hybrids were transformed into the strain XL 1 blue 
(available from Stratagene), which has a functional uracil- 
N-glycosylase; this enzyme inactivates the parent wild-type 55 
DNA strand and selects for mutant clones. DNA of several 
colonies were isolated and checked for proper mutation by 
sequencing. 

To express the mutant proteins, the DNA constructs 
obtained by mutagenesis were transformed into E. coli strain 60 
BL21(DE3)LysS (Novagen. Madison, Wis.), which has a 
chromosomal copy of T7 polymerase to drive expression 
from the strong T7 promoter. At room temperature 3 ml 
cultures were grown to saturation (typically, overnight) 
without induction. Cells from 1 ml of culture were collected. 65 
washed and finally resuspended in 100 ul of 50 mM Tris pH 
8.0. 300 mM NaCl. The cells were then lysed by three cycles 



10 

of freeze/thawing (liquid nitrogen/30 o C. water bath). The 
soluble fraction was obtained by pelletting cell debris and 
unbroken cells in a nucrofuge. 

To facilitate purification of the recombinant proteins, the 
vector used fuses a histidine tag (6 consecutive His) to the 
N-terminus of the expressed proteins. The strong interaction 
between histidine hexamers and Ni 2+ ions permitted purifi- 
cation of the proteins by NI-NTA resin (available commer- 
cially from Qiagen. Chatsworth. Calif.). Microcolumns (10 
ul bed volume) were loaded with 100 soluble extract (in 
50 mM Tris pH 8.0. 300 mM NaCl). washed with 10 bed 
volumes of the same buffer and with 10 volumes of the 
buffer containing 20 mM imidazole. The recombinant pro- 
teins were then eluted with the same buffer containing 100 
mM imidazole. 

Aliquots of the purified mutant GFP proteins were run 
along with wild-type GFP on a denaturing polyacrylamide 
gel. The gel was stained with Coomassie blue and the protein 
bands were quantified by scanning on a densitometer. Based 
on these results, equal amounts of each version of protein 
were used to run fluorescence emission and excitation 
spectra. 

FIGS. 4a and 4b compare the excitation and emission 
spectra of wild-type and Ser 65 mutants. In FIG. 4a. -S65T; 
- - S65A; - - - S65C; wild-type (emission at 508 nm). 
In FIG. 4B, - S65T; - - S65A; - - - S65C; ••• wild-type 
(excitation at 395 mn); wild-type (excitation at 475 
nm). Excitation and emission spectra were measured with 
1.8 nm bandwidlhs and the non-scanning wavelength set to 
the appropriate peak. As is apparent from FIG. 4b. all three 
mutants exhibited substantially higher intensity of emission 
relative to the wild-type protein. 

FIG. 5 illustrates the rates of fluorophore formation in 
wild-type GFP and in the Ser 65->Thr mutant E. coli 
expressing either wild-type or mutant GFP were grown 
anaerobically. At time=0. each sample was exposed to air: 
further growth and protein synthesis were prevented by 
transferring the cells to nutrient-free medium also containing 
sodium azide as a metabolic inhibitor. Fluorescence was 
subsequently monitored as a function of time. For each 
culture, the fluorescence intensities are expressed as a frac- 
tion of the final fluorescence intensity obtained at t=18 to 20 
hours, after oxidation had proceeded to completion. From 
FIG. 5. it is apparent that development of fluorescence 
proceeds much more quickly in the mutant than in wild-type 
GFP. even after normalization of the absolute brightnesses 
(FIGS. 4a and 4b). Therefore, when the development of GFP 
fluorescence is used as an assay for promoter activation and 
gene expression, the mutant clearly gives a more rapid and 
faithful measure than wild-type protein. 

FIGS. 6a and 6b illustrate the behavior of wild-type GFP 
and the Ser 65-»Thr mutant respectively, upon progressive 
irradiation with ultraviolet light Numbers indicate minutes 
of exposure to illumination at 280 nm; intensity was the 
same for both samples. Wild-type GFP (FIG. 6a) suffered 
photoisomerization. as shown by a major change in the 
shape of the excitation spectrum. Illumination with broad 
band (240-400 nm) UV caused qualitatively similar behav- 
ior but with less increase of amplitude in the 430-500 nm 
region of the spectrum. The photoisomerization was not 
reversible upon standing in the dark. This photoisomeriza- 
tion would clearly be undesirable for most uses of wild-type 
GFP. because the protein rapidly loses brightness when 
excited at its main peak near 395 nm. The mutant (FIG. 6b) 
showed no such photoisomerization or spectral shift. 

EXAMPLE 3 

GFP cDNAs encoding for Tyr66-*His (Y66H). 
Tyr66->Trp (Y66W). or Ser65->Thr (S65T) were separately 
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further mutagenized by the polymerase chain reaction and 
transformed into £ coli for visual screening of colonies with 
unusual intensities or colors. Isolation, spectral character- 
ization (Table U and FIG. 7). and DNA sequencing yielded 
several additional useful variants. 

Random mutagenesis of the gfp cDNA was done by 
increasing the error rate of the PCR with 0.1 mM MnQ 2 and 
unbalanced nucleotide concentrations. The GFP mutants 
S65T. Y66H and Y66W had been cloned into the BamHl 
site of the expression vector pRSETB (Invitrogen), which 
includes a T7 promoter and a polyhistidine tag. The GFP 
coding region (shown in bold) was flanked by the following 
5' and 3' sequences: 5-G GAT CCC CCC GCT GAA TTC 
ATG . . . AAATAATAA GGATCC-3'. The 5* primer for the 
mutagenic PCR was the T7 primer matching the v ector 
sequence; the 3* primer was 5'-GGT AAG CTT TTA TIT 
GTA TAG TTC ATC CAT GCC-3'. specific for the 3' end of 
GFP. creating a HindHI restriction site next to the stop 
codon. Amplification was over 25 cycles (1 min at 94° C, 
1 min 52° C. 1 min 72° C) using the AmphTaq polymerase 
from Perkin Elmer. Four separate reactions were run in 
which the concentration of a different nucleotide was low- 
ered from 200 uM to 50 uM. The PCR products were 
combined, digested with BamHI and Hindm and ligated to 
the pRSETB cut with BamHI and HindHI. The ligation 
mixture was dlalyzed against water, dried and subsequently 
transformed into the bacterial strain BL21(DE3) by elec- 
troporation (50 pi electrocorapetent cells in 0.1 cm cuvettes, 
1900 V, 200 ohm, 25 uF). Colonies on agar were visually 
screened for brightness as previously described herein. The 
selected clones were sequenced with the Sequenase version 
2.0 kit from Unites States Biochemical. 

Cultures with freshly transformed cells were grown at 37° 
C. to an optical density of 0.8 at 600 nm. then induced with 
0.4 mM isopropylthiogalactoside overnight at room tem- 
perature. Cells were washed in PBS pH 7.4. resuspended in 
50 mM Tris pH 8.0. 300 mM NaCl and iysed in a French 
press. The polyhistidine-tagged GFP proteins were purified 
from cleared lysates on nickel-chelate columns (Qiagen) 
using 100 mM imidazole in the above buffer to elute the 
protein. 

Excitation spectra were obtained by collecting emission at 
the respective peak wavelengths and were corrected by a 
Rhodamine B quantum counter. Emission spectra were 
likewise measured at the respective excitation peaks and 
were corrected using factors from the fluorometer manufac- 
turer (Spex Industries. Edison. N.J.). In cleavage experi- 



ments emission spectra were recorded at excitation 368 nm. 
For measuring molar extinction coefficients. 20 to 30 ug of 
protein were used in 1 ml of PBS pH 7.4. Quantum yields 
of wild-type GFP. S65T. and P4-1 mutants were estimated 
by comparison with fluorescein in 0.1 N NaOH as a standard 
of quantum yield 0.91 led. Miller. J. N.. Standards in 
Fluorescence Spectrometry (Chapman and Hall. New York. 
198 1)|. Mutants P4 and P4-3 were likewise compared to 
9-amino-acridine in water (quantum yield 0.98). W2 and W7 
were compared to both standards, which fortunately gave 
concordant results. 



FIG. 7 illustrates the fluorescence excitation and emission 
spectra of different GFP mutants. All spectra were normal- 
ized to a maximal value of 1. Each pair of excitation and 
emission spectrum is depicted by a distinct line style. 
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The fluorescence properties of the obtained GFP mutants 
are reported in Table D. 

TABLE H 

Fluorescence properties of GFP mutants 
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45 







Excitation 


Emission 


Extinct. Coeff. 


Quantum 


Clone 


Mutations 


max (nm) 


max (nm) 


(MT'cm -1 ) 


yield 


P4-3 


Y66H 


381 


445 


14,000 


0.38 




Y145F 










W7 


Y66W 


433 (453) 


475 (501) 


18,000 (17,100) 


0.67 




N146I 












M153T 












V163A 












N212K 










W2 


Y66W 


432 (453) 


480 


10,000 (9,600) 


0.72 




11 23V 












Y145H 












H148R 












M153T 












V163A 












N212K 










P4-1 


S65T 


504 (396) 


514 


14,500 (8.600) 


0.54 




M153A 












K238E 











SEQUENCE USTINO 



( I ) GENERAL INFORMATION: 

( i i k ) NUMBER OF SEQUENCES: 2 



{ 2 ) INFORMATION FOR SEQ ID NO.l : 

( i > SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 716 hwe pan 
i B > TYPE: nucleic acid 
( C ) STRANDEDNESS: tingle 
( D )TOPOLOOY: linear 



( i i ) MOLECULE TYPE: cONA 



I i x ) FEATURE: 

( A ) NAME/KEY: CDS 
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< B ) LOCATION: 1. 7 16 

I x i ) SEQUENCE DESCRIPTION: SEQ ID NO: I: 

ATG AO T AAA OOA O A A OA A CTT TTC ACT OGA GTT OTC CCA ATT CTT OTT 

Met Ser Lyi O 1 y Glo G ! o Leu Pbe Thi Gly Vil Val Pro lie Leu Vil 
1 5 10 15 

OAA TTA OAT GOT GAT GTT A A T OGO CAC AAA TTT TCT OTC ACT GOA GAG 

Glu Leu Asp Oly Asp Val Aid Gly Hit Lyj Pbe Ser V a 1 Ser Oly Olu 
2 0 2 5 3 0 

GOT OAA GOT OAT OCA ACA TAC OGA AAA CTT ACC CTT AAA TTT ATT TOC 

Oly Olu Gly Asp All Tbr Tyr Gly Lyi Leu Tbr Leo Lys Pbe lie Cyi 
3 5 4 0 4 5 

ACT ACT OOA AAA CTA CCT GTT CCA TOG CCA ACA CTT OTC ACT ACT TTC 

Thf Tbr Oly Lyi Leo Pro Val p f0 Trp Pro Tbr Leu Val Tht Tht Pbe 
5 0 5 5 6 0 

TCT TAT OOT GTT CAA TGC TTT TCA A OA TAC CCA OAT CAT ATO AAA CAG 

Ser Tyr Oly Val Olu Cyi Pbe Ser Arg Tyr Pro Asp Hit Met Lyi Gin 
6 5 70 75 SO 

CAT G A C TTT TTC A AG ACT GCC ATO CCC OAA OOT TAT OTA CAO OAA AGA 

His Asp Pbe Pbe Lyt Ser Ala Met Pro Olu Oly Tyr Val Ola Glu Arg 
» 5 9 0 9 5 

ACT ATA TTT TTC AAA OAT GAC GOO AAC TAC A A O ACA COT OCT OAA OTC 

Tbr tic Pbe Pbe L y » Asp Asp Oly Asa Tyr Lys Tbr Arg Ala Glu Val 
10 0 10 5 110 

A A O TTT G A A OOT GAT ACC CTT OTT A A T AO A ATC OAG TTA AAA OOT ATT 

Lys Pbe Glo Oly Asp Tbr Leu Val Asa Arg lie Olu Leo Lys Gty Me 
115 12 0 12 5 

OAT TTT AAA G A A GAT OOA AAC ATT CTT OGA CAC AAA TTO G A A TAC AAC 

Asp Pbe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Olu Tyr Asa 

13 0 13 5 140 

TAT AAC TCA CAC A A T OTA TAC ATC ATO GCA GAC AAA CAA A AG A A T OGA 

Tyr Aid Ser His Atn Val Tyr lie Met Ala Asp Lys Ola Lys Asa Oly 

145 150 155 160 

ATC AAA GTT AAC TTC AAA ATT AO A CAC AAC ATT OAA OAT GGA AOC OTT 

Me Lys Val Ass Pbe Lys Me Arg His Asa Me Olu Asp Oly Ser Val 
16 5 17 0 17 5 

CAA CTA OCA GAC CAT TAT CAA CAA A A T ACT CCA ATT OOC OAT OOC CCT 

Gla Leu Ala Asp His Tyr Gin Ola Asn Thr Pro lie Oly Asp Oly Pro 
18 0 18 5 19 0 

OTC CTT TTA CCA GAC AAC CAT TAC CTO TCC ACA CAA TCT OCC CTT TCG 

Val Leu Leu Pro Asp Asn Hi* Tyr Leu Ser Thr Ola Ser Ala Leu Ser 
195 200 205 

AAA GAT CCC AAC OAA A A O AGA GAC CAC ATO OTC CTT CTT OAO TTT OTA 

Lys Asp Pro Asn Olu Lys Arg Asp His Met Val Leu Leo Olu Pbe Val 

2 10 2 15 2 2 0 

ACA GCT OCT GOG ATT ACA CAT OOC ATO GAT OAA CTA TAC AAA TA 

Tbr Ala Ala Gly Me Tbr His Oly Met Asp Olu Leu Tyr Lys 

225 230 235 

( 2 > INFORMATION FOR SEQ ID N02: 

( i ) SEQUENCE CHARACTERISTICS: 

t A ) LENGTH: 238 amino acads 
( B > TYPE: amino acid 
( D > TOPOLOGY: linear 

( i i ) MOLECULE TYPE: proteii 

(i i ) SEQUENCE DESCRIPTION: SEQ ID NOi2: 

Met Ser Lys Oly Olu Glu Leu Pbe Tbr Oly Val Val Pro lie Leu Val 
1 5 10 15 

Glo Lea Asp Oly Asp Vat Asn Oly His Lyi Pbe Ser Val Ser Oly Olu 
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2 0 










2 5 










3 0 






G i y 


a l u 


O I y 
3 3 


Asp 


A 1 a 


T b r 


Ty r 


G 1 y 
4 0 


L y s 


Leu 


T h r 


Leu 


L y i 
4 3 


Pb e 


[ 1 e 


C y s 


T b r 


T b i 

5 0 


G 1 y 


Ly» 


Leu 


P I o 


V a 1 

5 5 


P 1 o 


T r p 


P r o 


T fa i 


Leu 

6 0 


V a 1 


T b i 






Ser 
6 S 


Ty t 


O 1 y 


V a 1 


G 1 a 


C y s 

7 0 


P b c 


S e r 


At g 


T y r 


Pro 

7 5 


A s p 


Hit 




L y • 


Gin 
8 0 


H i t 


Asp 


Pb e 


P fa e 


L y « 
8 5 


Ser 


A 1 a 


Me t 


P I o 


O I u 

9 0 


G 1 y 


T y r 


V a 1 




G 1 u 

9 5 




T b r 


t 1 e 


P h e 


P h e 
1 0 0 


Ly s 


A s p 


Asp 


O 1 y 


A s 0 

1 0 5 


Tyr 


Ly s 


T h r 


A r g 


A 1 a 

1 1 0 


G 1 u 


V a I 


L y » 


P b e 


G 1 u 
1 1 5 


0 1 y 


A s p 


T b r 


Leu 


V a 1 
1 2 0 


A S D 


At « 


I 1 e 


G 1 u 


Leu 
I 2 5 


L y i 


O 1 y 


lie 


A s p 


P b c 

1 3 0 


Ly s 


O 1 u 


A s p 


0 1 y 


Asa 

1 3 5 


I 1 e 


Leu 


G 1 y 


H i s 


L y s 
1 4 0 


Leu 


G 1 o 


T y r 




T y r 
1 4 5 


A s 0 


Set 






V a 1 

1 5 0 


T v r 
i y i 


1 1 e 


M e 1 


A 1 a 


A s p 

1 5 5 


L y s 


0 1 a 


L y s 


Asa 


G 1 y 
1 6 0 


I I « 


Ly i 


V a 1 


Asa 


P b e 
1 6 5 


Ly s 


) 1 C 


A r 8 


H i s 


A ■ a 

1 7 0 


I 1 e 


G 1 u 


A * p 


G 1 y 


S e I 
I 7 5 


Va 1 


G 1 a 


Leu 


A 1 a 


A i p 
1 ft 0 


H i s 


Ty r 


a 1 a 


O I B 


A i a 

I 8 5 


T h r 


Pro 


1 1 e 


G 1 y 


A s p 

1 9 0 


G 1 y 


Pro 


Va 1 


Lea 


Leu 
1 9 5 


P r o 


A * p 


A i a 


H i s 


Ty r 

2 0 0 


Leu 


S e r 


T b r 


G 1 n 


S e i 

2 0 5 


A 1 a 


Leu 


S e r 


Ly i 


A i p 

2 1 0 


P r o 


A t a 


G 1 u 


Ly * 


Al g 

2 1 5 


A s p 


H i s 


Me t 


V a 1 


Leu 

2 2 0 


Leu 


G 1 u 


P b e 


Va I 


T b r 

2 2 5 


A 1 a 


A I a 


a 1 y 


I 1 c 


T b r 

2 3 0 


H i s 


G 1 y 


Me t 


A • p 


O 1 u 

2 3 3 


Leu 


Ty r 


Ly s 







What is claimed is: 

1. A composition of matters comprising: 

a fluorescent modified form of an Aequorea wild-type ^ 
GPP polypeptide. 

characterized in that upon oxidation and cyclization of 
amino acid residues in said fluorescent modified form 
corresponding to positions 65 to 67 of wild-type GFP 
polypeptide sequence (SEQ ID NO:2) said fluorescent ^ 
modified form exhibits a different excitation and/or 
emission spectrum from a corresponding product of 
said wild-type GFP polypeptide sequence. 

with the proviso that when said fluorescent modified form 
comprises a mutation at S65. said mutation at S65 is x 
selected from the group consisting of S65A, S65C, 
S65T. S65L, S65V. and S65I. 

2. The composition according to claim 1. 

wherein said fluorescent modified form exhibits and alter- 
ation in the ration of two main excitation peaks relative 55 
to said wildrtype GFP polypeptide sequence. 

3. The composition according to claim 2. 

wherein said fluorescent modified form exhibits increased 
fluorescence at a shorter-wavelength peak of the two 
main excitation peaks than said wild-type GFP 60 
polypeptide sequence. 

4. The composition according to claim 3, 

wherein said fluorescent modified form comprises a 
replacement of Ser at a position corresponding to 
position 202 in said wild-type GFP polypeptide 65 
sequence by Phe and a replacement of Thr at a position 
corresponding to position 203 by Be. 



5. The composition according to claim 2. 

wherein said fluorescent modified form exhibits increased 
fluorescence at a longer-wavelength peak of the two 
main excitation peaks than said wild-type GFF 
polypeptide sequence. 

6. The composition according to claim 5. 

wherein said fluorescent modified form comprises a 
replacement of lie at a position corresponding to posi- 
tion 167 of said wild-type GFP polypeptide sequence 
by Val or Thr. 

7. The composition according to claim 5, 

wherein said fluorescent modified form comprises t 
replacement of Ser at a position corresponding tc 
position 65 of said wild-type GFP sequence by Thr. i 
replacement of Met at position 153 with Ala. and i 
replacement of Lys at position 238 with Glu. 

8. The composition according to claim 1. 

wherein said fluorescent modified form fluoresces at t 
shorter wavelength than said wild-type GFP polypep 
tide sequence. 

9. Hie composition according to claim 8. 

wherein said fluorescent modified form comprises i 
replacement of Tyr at a position corresponding u 
position 66 of said wild-type GFP polypeptidi 
sequence by Phe. His or Trp. 
The composition according to claim 8. 

wherein said fluorescent modified form comprises ; 
replacement of Tyr at a position corresponding t< 
position 66 of said wild-type GFP polypeptidi 
sequence by His and a replacement of Tyr at positioi 
145 with Phe. 
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11. The composition according to claim 8. 

wherein said fluorescent modified form comprises a 
replacement of iyr at a position corresponding to 
position 66 of said wild-type GFP polypeptide 
sequence by Tip. a replacement of Asn at position 146 5 
by De. a replacement of Met at position 153 by Thr. a 
replacement of Val at position 163 by Ala. and a 
replacement of Asn at position 212 by Lys. 

12. Hie composition according to claim 8. 

wherein said fluorescent modified form comprises a 10 
replacement of Tyr at a position corresponding to 
position 66 of said wild-type GFP polypeptide 
sequence by Trp. a replacement of De at position 123 by 
Val. a replacement of iyr at position 145 by His. a 
replacement of His at position 148 by Arg. a replace- 15 
ment of Met at position 153 by Thr. a replacement of 
Val at position 163 by Ala. and a replacement of Asn at 
position 212 by Lys. 

13. The composition according to claim 1. 

wherein said fluorescent modified form exhibits enhanced 20 
emission relative to said wild-type GFP polypeptide 
sequence. 

14. The composition according to claim 13. 

wherein said fluorescent modified comprises a replace- 25 
ment of Ser at a position corresponding to position 65 
of said wild-type GFP polypeptide sequence by an 
amino acid selected from the group consisting of Ala. 
Cys. Thr. Leu, Val and lie. 

15. The composition according to claim 14. 
wherein said amino acid is Cys of Thr. 

16. A functional mutant fluorescent protein, comprising: 
a protein with an amino acid sequence that differs from an 

amino acid sequence of an Aequorea wild type green 
fluorescent protein (SEQ ID NO: 2) by at least one 35 
amino acid substitution that is at position 65. wherein 
said at least one substitution is either S65A, S65C 
S65T. S65L. S65V, or S651. 
wherein said functional mutant fluorescent protein has an 
excitation or emission different from an excitation 40 
spectrum or emission spectrum of said Aequorea wild 
type green fluorescent protein. 

17. The functional mutant fluorescent protein of claim 16. 
wherein said at least one amino acid substitution that is at 

position 65 is S65A. « 

18. The functional mutant fluorescent protein of claim 17. 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 

19. The functional mutant fluorescent protein of claim 16, 
wherein said at least one amino acid substitution that is at 

position 65 is S65C. 

20. The functional mutant fluorescent protein of claim 19, 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 5J 

21. The functional mutant fluorescent protein of claim 16, 
wherein said at least one amino acid substitution that is at 

position 65 is S65T. 

22. The functional mutant fluorescent protein of claim 21. 
wherein said functional mutant fluorescent protein com- ^ 

prises a fusion protein. 

23. The functional mutant fluorescent protein of claim 16. 
wherein said at least one amino acid substitution that is at 

position 65 is S65L. 

24. The functional mutant fluorescent protein of claim 23. 65 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 
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25. The functional mutant fluorescent protein of claim 16. 
wherein said at least one amino acid substitution that is at 

position 65 is S65V. 

26. The functional mutant fluorescent protein of claim 25. 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 

27. The functional mutant fluorescent protein of claim 16. 
wherein said at least one amino acid substitution that is at 

position 65 is S65L 

28. The functional mutant fluorescent protein of claim 27. 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 

29. The functional mutant fluorescent protein of claim 16. 
wherein said functional mutant fluorescent protein con- 
sists of mutations S65T. M153A. and K238E. 

30. The functional mutant fluorescent protein of claim 29. 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 

31. The functional mutant fluorescent protein of claim 16. 
wherein said functional mutant fluorescent protein further 

comprises an amino acid sequence which targets said 
protein to the specific cellular locations. 

32. A functional mutant fluorescent protein, comprising: 
a protein with an amino acid sequence that differs from an 

amino acid sequence of an Aequorea wild type green 
fluorescent protein (SEQ ID NO:2) by at least one 
amino acid substitution that is at position 66. wherein 
said at least one substitution is either Y66H or Y66W. 
further wherein said functional mutant fluorescent pro- 
tein has an excitation or emission different from an 
excitation spectrum or emission spectrum of said 
Aequorea wild type green fluorescent protein. 

33. The functional mutant fluorescent protein of claim 32. 
wherein said at least one amino acid substitution that is at 

position 66 is Y66W. 

34. The functional mutant fluorescent protein of claim 33. 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 

35. The functional mutant fluorescent protein of claim 32, 
wherein said at least one amino acid substitution that is at 

position 66 is Y66H. 

36. The functional mutant fluorescent protein of claim 35. 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 

37. The functional mutant fluorescent protein of claim 32. 
wherein said functional mutant fluorescent protein con- 
sists of mutations Y66H and Y145F. 

38. The functional mutant fluorescent protein of claim 37. 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 

39. The functional mutant fluorescent protein of claim 32. 

wherein said functional mutant fluorescent protein con- 
sists of mutations Y66W. N146L M153T. V163A and 
N212K. 

40. The functional mutant fluorescent protein of claim 39. 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 

41. The functional mutant fluorescent protein of claim 32. 

wherein said functional mutant fluorescent protein con- 
sists of mutations Y66W. I123V. Y145H, H148R. 
M153T. V163A. and N212K. 

4Z The functional mutant fluorescent protein of claim 41. 
wherein said functional mutant fluorescent protein com- 
prises a fusion protein. 
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43. A functional mutant fluorescent protein, comprising: 
a protein with an amino acid sequence that differs from an 

amino acid sequence of an Aequorea wild type green 
fluorescent protein (SEQ ID NO:2) by at least one 
amino acid substitution in the region consisting of 5 
positions 65 and 66. 
wherein said functional mutant fluorescent protein has an 
excitation or emission different from an excitation 
spectrum or emission spectrum of said Aequorea wild 
type green fluorescent protein. 

44. The functional mutant fluorescent protein of claim 43. 

wherein said functional mutant fluorescent protein exhib- 
its an alteration in the ratio of two main excitation 
peaks relative to Aequorea wild type green fluorescent 15 
protein. 

45. The functional mutant fluorescent protein of claim 44. 

wherein said functional mutant fluorescent protein exhib- 
its increased fluorescence at the shorter-wavelength 
peak of said two main excitation peaks. 20 

46. The functional mutant fluorescent protein of claim 43. 
further comprising an amino acid substitution that is at 

position 202. 

47. The functional mutant fluorescent protein of claim 46. 
wherein said amino acid substitution that is at position 25 

202 is S202F. 

48. The functional mutant fluorescent protein of claim 43. 
further comprising an amino acid substitution that is at 

position 203. M 

49. The functional mutant fluorescent protein of claim 48. 
wherein said amino acid substitution that is at position 

203 is T203L 

50. The functional mutant fluorescent protein of claim 43. 
further comprising an amino acid substitution that is at 35 

position 167. 

51. The functional mutant fluorescent protein of claim 50. 
wherein said amino acid substitution that is at position 

167 is 1167 V or I167T. 

52. The functional mutant fluorescent protein of claim 43. 40 
further comprising an amino acid substitution that is at 

position 153. 
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53. The functional mutant fluorescent protein of claim 52. 
wherein said amino acid substitution that is at position 

153 is M153Tor M153A. 

54. The functional mutant fluorescent protein of claim 43. 
further comprising an amino acid substitution that is at 

position 238. 

55. The functional mutant fluorescent protein of claim 54. 
wherein said amino acid substitution that is at position 

238 is K238E. 

56. The functional mutant fluorescent protein of claim 43. 
further comprising an amino acid substitution that is at 

position 145. 

57. The functional mutant fluorescent protein of claim 56. 
wherein said amino acid substitution that is at position 

145 is Y145H orY145F. 

58. The functional mutant fluorescent protein of claim 43. 
further comprising an amino acid substitution that is at 

posiuon 146. 

59. The functional mutant fluorescent protein of claim 58, 
wherein said amino acid substitution that is at position 

146 is N146I. 

60. The functional mutant fluorescent protein of claim 43. 
further comprising an amino acid substitution that is at 

posiuon 163 or 148. 

61. The functional mutant fluorescent protein of claim 60. 
wherein said amino acid substitution that is at position 

163 is VI 63 A and said amino acid substitution at 
posiuon 148 is H148R. 

62. The functional fluorescent protein of claim 43. 
further comprising an amino acid substitution that is at 

position 212 or 123. 

63. The functional fluorescent protein of claim 62. 
wherein said amino acid substitution that is at position 

212 is N212K and said amino acid substitution at 
position 123 is 1 123 V. 

64. The functional fluorescent protein of claim 43. 
wherein said functional fluorescent protein comprises a 

fusion protein. 

***** 
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